Errata for "Building Efficient Query Engines in a High-Level Language" (PVLDB 7(10): 853-864)
نویسندگان
چکیده
This is in response to recent feedback from our peers that calls for a number of clarifications regarding the experimental section of our paper. Table 1 clarifies which optimizations are used in each evaluated flavor of LegoBase. HyPer-simulated is a configuration of the LegoBase codebase that mirrors the HyPer system as closely as possible by just activating some of the main optimizations and deactivating others: using the push engine, with operator inlining active but data structure optimizations and data layout transformations turned off. In addition, as noted in our paper (footnote 8), the actual HyPer system uses query plans generated by its own query optimizer while HyPer-simulated uses query plans from DBX. We use TPC-H queries and generated data at scaling factor 8 to evaluate the impact of our compilation techniques. For all evaluated systems, reported query evaluation times only include the execution time of the query and exclude the time taken for query optimization/compilation and loading the data into the main-memory data structures. We note that the data structure specialization optimizations of Section 3 of our paper (whenever used) are a gentle form of pre-computation, as would arguably be loading row data from disk into a main-memory column store.
منابع مشابه
Building Efficient Query Engines in a High-Level Language
In this paper we advocate that it is time for a radical rethinking of database systems design. Developers should be able to leverage high-level programming languages without having to pay a price in efficiency. To realize our vision of abstraction without regret, we present LegoBase, a query engine written in the high-level programming language Scala. The key technique to regain efficiency is t...
متن کاملWETSUIT: An Efficient Mashup Tool for Searching and Fusing Web Entities
We demonstrate a new powerful mashup tool called WETSUIT (Web EnTity Search and fUsIon Tool) to search and integrate web data from diverse sources and domain-specific entity search engines. WETSUIT supports adaptive search strategies to query sets of relevant entities with a minimum of communication overhead. Mashups can be composed using a set of high-level operators based on the Javacompatibl...
متن کاملDomain Specific Multi-stage Query Language for Medical Document Repositories
Vast amount of medical information is increasingly available on the Web. As a result, seeking medical information through queries is gaining importance in the medical domain. The existing keyword-based search engines such as Google, Yahoo fail to suffice the needs of the health-care workers (who are well-versed with the domain knowledge required for querying) using these they often face results...
متن کاملChanging Engines in Midstream: A Java Stream Computational Model for Big Data Processing
With the addition of lambda expressions and the Stream API in Java 8, Java has gained a powerful and expressive query language that operates over in-memory collections of Java objects, making the transformation and analysis of data more convenient, scalable and efficient. In this paper, we build on Java 8 Stream and add a DistributableStream abstraction that supports federated query execution o...
متن کاملDeveloping a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 7 شماره
صفحات -
تاریخ انتشار 2014